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Abstract: Overdispersed count data are modelled with likelihood and 
non-likelihood approaches. Likelihood approaches include the Poisson mix- 
tures with three distributions, the gamma, the lognormal, and the inverse 
Gaussian distributions. Non-likelihood approaches include the robust sand- 
wich estimator and quasilikelihood. In this simulation study, overdispersed 
count data were simulated under the Poisson mixtures with the gamma, 
the lognormal and the inverse Gaussian distributions, then analyzed with 
the five likelihood and non-likelihood approaches. Our results indicated 
that 1) when the count data are mildly overdispersed, there are virtu- 
ally no differences in type I error rate, standard error of the main effect, 
and empirical power among the five methods; 2) when the count data are 
very overdispersed, none of these five approaches is robust to model mis- 
specification as evaluated by type I error rate, standard error of the main 
effect, and empirical power. This simulation study raises caution on us- 
ing non-likelihood method for analyzing very overdispered count data be- 
cause of likely higher type I error and inappropriate power levels. Unlike 
non-likelihood approaches, likelihood approaches allow for statistical tests 
based on likelihood ratios and for checking model fit and provide basis for 
power and sample size calculations. When likelihood approaches are used, 
we suggest comparing likelihood values to select the appropriate parametric 
method for analyzing very overdispersed count data. 

AMS 2000 subject classifications: Primary 60K35, 60K35; secondary 
60K35. 

Keywords and phrases: Poisson mixture, Overdispersion, Gamma, Log- 
normal, Inverse Gaussian, Maximum likehood estimate. 
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Poisson regressions are widely used in analyzing count data which often arise 
in medical research such as counts of adverse events, number of episodes of a 
chronic disease, and utilization of a health care system (Neas, Schwartz, and 
Dockery, 1999; Thall and Vail, 1990; Estabrooks, Nelson, Xu, et al, 2005; Sills, 
Shetterly, Xu, et al, 2007). A complication in Poisson regressions is that the 
assumption of mean equal to variance may be violated. Although it is rare, 
underdispersion (variance less than mean) can happen (Efron, 1986; Faddy, 
1997; Ridout and Besbeas, 2004). More commonly, if the variance is greater than 
the mean, the count data are overdispersed. Failure to deal with the dispersion 
results in false conclusions, often falsely rejecting the null hypothesis although 
the parameter estimates from maximum likelihood are consistent and efficient 
(Paul and Plackeet, 1978; Cox, 1983). 

There are two classes of approaches to deal with overdispersion in count 
data analyses: non-likelihood and likelihood. Non-likelihood approaches such as 
the robust sandwich estimator (Huber, 1967; White, 1982) and quasilikelihood 
(McCullagh and Nelder, 1989) do not assume a probability distribution of the 
unobserved heterogeneity which causes overdispersion. Likelihood approaches 
use a Poisson mixture with a non-negative variable, v, which is often called the 
extra-variation factor. Because of its importance and wide use, a log link func- 
tion between independent variables and the mean of the dependent variable is 
assumed in this paper. Let jij = exp(xj/3) where /3 is a column vector of the 
fixed regression coefficients to be estimated, and Xj is a row vector of covariates 
for the jth subject, /ij is the Poisson distribution parameter without overdisper- 
sion. With overdispersion, the Poisson distribution parameter can be modeled 
as (Cameron and Trivedi, 1998, plOO), 

^3 = = exp(xj/3). (1) 

Different distributions of v with mean 1 and variance have been proposed to 
take into account overdispersion when count data are analyzed, including the 
gamma distribution (Manton, Woodbury, and Stallard 1981; Margolin, Kaplan, 
and Zeiger 1981), lognormal distribution (Hinde, 1982), and inverse Gaussian 
distribution (Sichel, 1971, 1973 and 1974; Houggard, Lee, and Whitmore, 1997). 

However, only a few studies have compared these methods based on limited 
evaluation measures (Breslow, 1990; Lee and Nelder, 2000). There is not a sys- 
tematic comparison of all these likelihood and non-likelihood approaches based 
on broad evaluation measures. Theoretical comparison is not available because 
there are no closed form solution for coefficients for methods such as likelihood 
methods requiring numerical integration and non-likelihood methods. To ad- 
dress the gap in the current literature, we conducted a simulation study which 
evaluates these approaches based on type I error rate, bias, standard error of 
main eff'ect, and empirical power. The likelihood and non-likelihood methods 
are reviewed in section 2. In section 3, count data are simulated under various 
assumptions and analyzed with these five approaches. 

2. Overdispersion models 
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Let y have a Poisson distribution with parameter A as in ([1]). Then the mean 
and variance of y are E(?/) = ji and Var(?/) = /i + Var(j/)/i^. 

2.1. Gamma distribution of v 

Let in ([1]) have a gamma distribution with mean 1 and Var(^') = t, 

qMt) — — — T-i'^^^^^ exp(~-). 
' ^ r(i)rT ^ t' 

The marginal distribution of y is 

^(yM:^)= / i gii^lTjdi^^ — — r 

J y'- 2;!r(i)(l + r/i)^+- 

which is the negative binomial distribution. This well known result is derived 
by Cameron and Trivedi (1998) citing references back to 1920. 

When r — > 0, the negative binomial distribution approaches the Poisson 
distribution. For an observation, yj, the log likelihood {II j) is 

llj = yj log(r^j ) - {yj + 1/r) log(l + t^Ij ) 

+ log(r(2;, + 1/r)) - log(r(l/r)) - log{T{y, + 1)). 

This is a useful computing formula since most computing packages have the 
log of a gamma function as a standard function. Lawless (1987) examined the 
efficiency and robustness properties of this Poisson mixture with v following a 
gamma distribution (GM method). 

2. 2. Lognormal distribution of v 

Another distribution for the extra-variation factor, v., in ([T|) is lognormal. Few 
studies have been carried out using the lognormal distribution for this purpose 
(Hinde, 1982). However, it has been used in analyzing correlated non-Gaussian 
variables (Stiratelli, Laird and Ware, 1984; Pinheiro and Bates, 1995). By in- 

2 

eluding a random coefficient, 7^, and an unknown constant, in the fixed 
effect as in equation ^ , the potential overdispersion in count data is taken into 
account. 7 is assumed to be iV(0, a"^). Then 

Aj = exp(xj/3 + 7j - y) = Vj^i^ (2) 

where = exp(7j — ^) has a lognormal distribution with mean equal to 1 
and Var(j/) = exp{a^) — 1 based on the properties of the lognormal distribution 
(Casella and Berger, 1990, ppllO-111). 



Imsart-generic ver. 2008/01/24 file: ejs_2008_291.tex date: September 5, 2008 



Stanley Xu/ Analyzing very dispersed count data 



4 



The above form of \j in ([2]) is very similar to the negative binomial distri- 
bution except that Vj = exp(7j — ^) is a lognormal distribution here, while Uj 
is a gamma distribution in the negative binomial case. In fact, the lognormal 
distribution is similar in shape to the gamma distribution (Casella and Berger, 
1990, pill). 

The marginal probability density function of the observation has no closed 
form but can be approximated by numerical integration (Pinheiro and Bates, 
1995) to calculate 



/(y) = / ^1 f(y)dv 



which is equivalent to 



exp[-exp(7)Ml(exp(7).-)- ^ exp(-2,).. 



where /i* = exp(xj/3 — ^). 



2.3. Inverse Gaussian distribution of v 



The inverse Gaussian distribution has been used to handle overdispersion of 
count data (Sichel, 1971, 1973 and 1974; Houggard, Lee, and Whitmore, 1997). 
Let V have an inverse Gaussian distribution (1, ^), 



Then E(i^) = 1 and \ai{v) = a. 

The marginal probability density function of y has no closed form. Shoukri, 
et al. (2004) developed estimating equations to obtain estimates of fixed effects. 
The marginal probability density function of y can be obtained by numerical 
integration. For simplicity, and to obtain a likelihood, we used numerical inte- 
gration in this paper to calculate 



\2 



J yl \ 27rai/3 2ai' ' 

For the three likelihood methods, their log likelihood values are summed over 
subjects and maximum likelihood estimates (MLEs) are obtained numerically. 
We refer to these three likelihood approaches as the GM, LN and IG methods. 



2.4. Sandwich robust estimator 

The robust sandwich variance estimator, which was originally introduced by Ru- 
ber (1967) and White (1982), has been widely used for analyzing overdispersed 
count data. A so-called "working covariance matrix" is used in the estimation 
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step (Zeger and Liang, 1986). Then the "working covariance matrix" is combined 
with its corresponding empirical version in a sandwich estimate form for covari- 
ance estimation. The robust sandwich estimator does not make distributional 
assumptions regarding the unobserved heterogeneity. 

2.5. Quasilikelihood method 

Quasilikelihood is also widely used to account for unobserved heterogeneity in 
count data analysis (McCuUagh and Ncldcr, 1989). The quasilikelihood function 
is obtained by dividing a log-likelihood function for Poisson distribution by a 
dispersion parameter cj). Estimation of /3 and its standard error are computed 
using the quasilikelihood in place of the log likelihood. Quasilikelihood is avail- 
able in commercial software (e.g., SAS PROC GENMOD with dscale option). 
The scale parameter is a fixed parameter which can be estimated from data. 
The variance of y is then 

Var(t/) ~ (f)^. 

(j) < 1 models underdispersion and </> > 1 models overdispersion. Breslow (1990) 
evaluated its performance in analyzing overdispersed Poisson regression. 

The moments of the mean (A) and marginal outcome {y) for these three 
likelihood based methods are shown in Table 1. All of the marginal outcomes 
have quadratic variance function of the form Var{y) = E{y) + cE{y)'^ where 
c = Var{v) is a constant depending on model parameters. We simulated overdis- 
persed count data from the Poisson distribution mixture with three different 
distributions of v but with the same level of overdispersion (same value of c) 
according to Table 1. 

3. Simulation studies 

The simulations are designed to evaluate these five methods of handling overdis- 
persion of count data under different assumptions. 

3.1. Simulation algorithm 

1. Choose the distribution of v from one of the three distributions, the gamma, 
the lognormal and the inverse Gaussian distributions. 

2. Generate the dichotomous independent variable Xj with half the obser- 
vations having zero for the control group and the other half having 1 for the 
treatment group, j=l to 500. 

3. Select values for the intercept (/3o)) the treatment effect (/3i), and the 
parameter for the distribution of v. 

4. Generate independently Vj, j=l to 500, from the selected distribution of v 
in step 1. 

5. Calculate \j = i/j exp(/3o + Xjl3i) and generate randomly yj from the 
Poisson (Aj) distribution. 
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6. Analyze each dataset with each of the five approaches described in section 

2. 

Steps 1-6 are repeated 500 times (500 replications) for each combination of 
parameters. 

3.2. Simulation parameters 

For the fixed effects, we choose 0.693 for /?o in all simulations so that the baseline 
rate is A = cxp(0.693)=2. Wc consider two values for 0i, 0.3 and 0.5. This range 
of j3i will allow us to illustrate the difference in empirical power between the 
five approaches. /3i = is also studied for comparing type I error rates. 

For the distribution of v, five values of the distribution parameter were set 
up to represent different levels of overdispersion. We also use the facts in Table 
1 to generate overdispersed count data with the same level of overdispersion but 
from different distributions of v while /3o and f3i arc fixed. For example, count 
data with u having a gamma distribution with t = 0.5 have the same mean and 
variance as count data with u having a lognormal distribution with = 0.405, 
and count data with v having an inverse Gaussian distribution with a = 0.5. 

3.3. Evaluation measures 

These five approaches arc compared based on the type I error rates, the bias 
of estimates of standard error of /3i, standard deviation of /3i's, empirical 
power, and rates of correct model choice. 

1. Type I error rates. The type I error rates are computed as the proportion of 
p- values less than 0.05 under a null hypothesis of no intervention effect (/3i=0) 
based on Wald tests. 

2. Bias. The bias is calculated as the average of the 500 estimates of /3i 
minus the pre-specified true values. Negative bias represents underestimation of 
the true value and positive bias represents overestimation of the true values. 

3. The mean of standard errors of standard deviation of /?i's, and empirical 
power. The mean of standard errors of /3i is calculated as the average of the 500 
estimates of standard errors of $i. The standard deviation of /3i's is the standard 
deviation of the 500 estimates of Empirical power is the proportion of the 
500 replications with p-values less than 0.05 under the alternative hypothesis 
for a given value of The estimate of /3i and its standard error are factors 
determining the p-value, the significance of difference between the control and 
treatment groups, which is generally the main interest of a study. 

4. Rates of choosing the true models are calculated as the percentage of those 
replications with the -2 log likelihood value using the true distribution of v less 
than the -2 log likelihood value using the other two distributions. 

4. Results 
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4-1. Type I error rate 

Type I error rates are shown in Table 2. When the true distribution of v is 
the gamma, the GM, LN, IG methods, and the robust sandwich estimator yield 
comparable and appropriate type I error rates (around 0.05) while inflated type 
I errors rates for quasilikelihood increase with increasing value of r. When the 
true distribution of v is the lognormal or inverse Gaussian, the LN, IG and robust 
sandwich estimator give similar and appropriate type I error rates (around 0.05) 
while the GM and quasilikelihood methods have larger type I error rates with 
higher levels of over dispersion, in some cases well above 0.05. 

4.2. Bias 

Only when v has a gamma distribution with r > 4, the LN and IG methods 
underestimate j3i (Table 3). The reason for this is imknown. Under other as- 
sumptions, all these five approaches produce ignorable biases. This is expected 
since unobserved heterogeneity is thought to have an impact on the efficiency 
of estimation, not on estimate itself. 

4-3. The mean of standard errors of Pi, standard deviation of Pi 's, and empir- 
ical power. 

The mean of standard errors of /3i, standard deviation of /3i's, and empirical 
power are reported in Tables 4 and 5. The mean of standard errors of $1 and 
standard deviation of Pi 's do not change with different values of Pi . Only Pi = 
0.3 is reported in Table 4. It is thought that standard deviation of /3i's is the 
true representative of the standard errors of pi. Thus the model with smallest 
standard deviation of /3i's and the mean of standard errors of Pi close to that 
is considered the best. When the underlying distribution of v is gamma and 
T > 4, the GM and RS methods are considered as appropriate and they gave 
similar standard errors of Pi and empirical powers. The LN and IG methods 
overestimate the standard errors of Pi . Subsequently the LN and IG methods 
have smaller power which can be up to 0.23 less than the GM method. Although 
the IG method overestimates the standard error of Pi less than the LN method, 
it underestimates Pi more than the LN method. It turns out that the LN and IG 
methods yield a similar level of empirical power. In contrast, the quasilikelihood 
method underestimates the standard error and gives inflated type I error and 
empirical power which can be up to 0.21 greater than the GM method. 

When the underlying distribution of u is lognormal and > 1.609, the LN 
and IG methods are considered appropriate and they gave the similar standard 
errors and empirical powers. The GM and quasilikelihood methods slightly un- 
derestimate the standard errors of $1 and give empirical powers slightly greater 
than the LN method. In contrast, the robust sandwich estimator overestimates 
the standard errors of Pi and gives empirical powers smaller than the LN method 
(up to 0.17). 
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When the underlying distribution of v is inverse Gaussian and a > 4, the LN 
and IG methods arc considered appropriate and they gave the similar standard 
errors and empirical powers. The robust sandwich estimator overestimates the 
standard errors of (3\ and gives empirical powers smaller than the IG method 
(up to 0.15). In contrast, the quasilikelihood method slightly underestimates 
the standard errors of fii and gives empirical powers slightly greater than the 
IG method (up to 0.12). The GM, LN, and IG methods produce comparable 
standard errors of /3i and empirical powers. 

4-'l- Rates of choosing the true models based on -2 log likelihood values 

Rates of choosing the true models based on -2 log likelihood values (Table 6) 
showed that the -2 log likelihood value is a good criterion to determine which 
model is more appropriate, especially when count data are very overdispersed. 
From Tables 2, 3 and 4, that is also when /3i is underestimated, and when the 
standard errors of /3i is under or over-estimated, and when higher type I error 
rate can arise. 

5. Discussion 

In this simulation study, we demonstrate that 1) there is little bias in any of 

the methods considered. Only when the true distribution of ly is gamma and 
Var(j/) > 4.0, the LN and IG methods underestimate 2) different approaches 
do have impact on the estimated standard errors of Pi , and on type I error rate 
and empirical power. Specifically, 3) when the count data are mildly overdis- 
persed (e.g., Var(t^) < 2.0), there are virtually no differences in type I error 
rate, standard error of /3i, and empirical power among the five methods; 4) 
when the count data are very overdispersed (e.g., Var(i/) > 4.0), none of these 
five approaches is robust to misspecification of the underlying distribution of v 
as evaluated by type I error rate, the standard error of /3i, and empirical power; 
5) -2 log likelihood values provide a good way of choosing the method for dealing 
with very overdispersed count data (e.g., Var(i^) > 4.0). With our sample size of 
n=500, over 95 percent of the time, one would choose the true model between 
the GM method and one of the LN or IG methods. The LN and IG methods 
are less distinguishable. 

Because quasilikelihood has high type I error rate and inflated power, and 
the robust sandwich estimator gives smaller empirical power for very overdis- 
persed coimt data, we recommend use of likelihood approaches. Unlike non- 
likelihood approaches, likelihood approaches allow for statistical tests based on 
likelihood ratios and for checking model fit and provide basis for power and sam- 
ple size calculations. There are also practical considerations in choosing among 
the three Poisson distribution mixtures. The likelihood for the GM method can 
be computed using the standard closed form expressions for the negative bino- 
mial distribution, as in Section 2.1. The likelihood for the LN method (Section 
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2.2) requires a numerical integration, but since this method can be viewed as 
a Poisson generalized linear model with log link and a normal random effect 
in the linear predictor, this calculation is available in some standard softwares 
(e.g., SAS PROC NLMIXED). The likelihood for the IG method (Section 2.3) 
also requires a numerical integration which is not available in standard soft- 
ware. Only when a very long and slightly heavy tail exists in count data, the 
inverse Gaussian has some advantage (Wilmot, 1987). This simulation study 
raises caution on using non-likelihood method for analyzing very ovcrdispcred 
count data because of likely higher type I error and inappropriate power levels. 
When likelihood approaches are used, likelihood ratio tests have been used for 
testing overdispersion. Pearson Statistic and Deviance Statistic arc available for 
assessing overall goodness-of-fit of a model for fitting count data (Cameron and 
Trivedi, 1998). However, these three likelihood approaches may not be distin- 
guishable with Pearson Statistic and Deviance Statistic. We propose comparing 
likelihood values to select the appropriate parametric method for analyzing very 
overdispersed count data. This is equivalent to AIC (Akaike, 1974) because the 
number of parameters in these three parametric models is the same. Results 
in Tables 2, 4 and 5 indicate that the standard error of /3i or empirical power 
using the incorrect model are a potential problem when count data are very 
overdispersed, but this approach has an excellent chance of selecting the most 
appropriate model. 
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Table 1 

Moments of the mean (X) and marginal outcome (y) 



moments 




distribution of i/ 




mean or variance 


gamma 


lognormal 


inverse Gaussian 


E(A) 








V{\) 




(exp(a2^-l),i2 




E{y) 








V{y) 


H + T/J? 


H + (exp((72) - 





Stanley Xu/ Analyzing very dispersed count data 



12 



Table 2 

Type I error rates by different distributions of v and different levels of overdispersion based, 

on 500 replications 



true 






likelihood 




non-likeliliood 




dist'n of u 


parm 


GM 


LN 


IG 


RS 


QL 


GM 


T=0.5 


0.046 


0.032 


0.054 


0.054 


0.052 




T=1.0 


0.056 


0.052 


0.024 


0.058 


0.052 




T=2.0 


0.054 


0.042 


0.042 


0.048 


0.086 




T=4.0 


0.042 


0.050 


0.056 


0.050 


0.128 




T=6.0 


0.034 


0.042 


0.046 


0.046 


0.156 


LN 


0-2 = 0.405 


0.050 


0.042 


0.050 


0.042 


0.046 




0-2 = 0.693 


0.072 


0.056 


0.054 


0.054 


0.076 




0-2 = 1.098 


0.088 


0.044 


0.042 


0.056 


0.098 




0-2 = 1.609 


0.130 


0.054 


0.062 


0.050 


0.158 




0-2 = 1.946 


0.130 


0.052 


0.060 


0.058 


0.186 


IG 


a=0.5 


0.052 


0.042 


0.038 


0.058 


0.060 




a=1.0 


0.064 


0.052 


0.044 


0.052 


0.074 




0=2.0 


0.104 


0.062 


0.058 


0.066 


0.110 




0=4.0 


0.124 


0.058 


0.034 


0.056 


0.168 




0=6.0 


0.148 


0.064 


0.066 


0.064 


0.196 
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Table 3 

Mean bias by different distributions of v and different levels of overdispersion based on 500 

replications 



true 








likelihood 




non-likclihood 




dist'n of u 


parm 


/3i 


GM 


LN 


IG 


RS 


QL 


GM 


r=0.5 


0.3 


0.005 


0.005 


0.004 


0.005 


0.005 






0.5 


0.006 


0.004 


0.003 


0.006 


0.006 




T=1.0 


0.3 


0.000 


0.000 


-0.004 


0.000 


0.000 






0.5 


0.001 


-0.001 


-0.007 


0.001 


0.001 




T=2.0 


0.3 


0.011 


0.006 


0.007 


0.011 


0.011 






0.5 


0.010 


0.005 


0.007 


0.010 


0.010 




T=4.0 


0.3 


-0.013 


-0.041 


-0.062 


-0.013 


-0.013 






0.5 


0.005 


-0.030 


-0.077 


0.005 


0.005 




T=6.0 


0.3 


-0.015 


-0.037 


-0.074 


-0.015 


-0.015 






0.5 


0.004 


-0.034 


-0.105 


0.004 


0.004 


LN 


0-2 = 0.405 


0.3 


0.001 


0.001 


0.001 


0.001 


0.001 






0.5 


-0.002 


-0.001 


-0.002 


-0.002 


-0.002 




0-2 = 0.693 


0.3 


0.002 


0.001 


0.002 


0.002 


0.002 






0.5 


0.001 


0.000 


0.001 


0.001 


0.001 




0-2 = 1.098 


0.3 


0.001 


0.002 


0.006 


0.001 


0.001 






0.5 


0.002 


0.007 


0.006 


0.002 


0.002 




0-2 = 1.609 


0.3 


0.003 


0.009 


0.006 


0.004 


0.004 






0.5 


-0.006 


-0.003 


0.002 


-0.004 


-0.004 




0-2 = 1.946 


0.3 


-0.005 


0.003 


0.002 


-0.003 


-0.003 






0.5 


-0.006 


0.006 


0.002 


0.006 


0.006 


IG 


a=0.5 


0.3 


0.002 


0.000 


0.000 


0.002 


0.002 






0.5 


0.001 


0.000 


0.000 


0.001 


0.001 




a=1.0 


0.3 


-0.003 


-0.004 


-0.004 


-0.003 


-0.003 






0.5 


0.001 


0.001 


0.006 


0.001 


0.001 




a=2.0 


0.3 


0.011 


0.006 


0.000 


0.011 


0.011 






0.5 


0.010 


0.005 


0.007 


0.010 


0.010 




a=4.0 


0.3 


0.011 


0.002 


0.001 


0.011 


0.011 






0.5 


0.010 


-0.002 


0.000 


0.010 


0.010 




a=6.0 


0.3 


0.013 


0.009 


0.012 


0.014 


0.014 






0.5 


0.010 


0.006 


0.008 


0.013 


0.013 
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Table 4 

The mean of standard errors of 0i (standard deviation of $i's) by different distributions of 
V and different levels of overdispersion based on 500 replications for /3i = 0.3 



true 






likelihood 




non-likelihood 




dist'n of u 


parm 


GM 


LN 


IG 


RS 


QL 


GM 


r=0.5 


0.086 


0.088 


0.088 


0.086 


0.087 






(0.088) 


(0.090) 


(0.089) 


(0.088) 


(0.088) 




T=1.0 


0.107 


0.112 


0.111 


0.107 


0.103 






(0.106) 


(0.112) 


(0.111) 


(0.106) 


(0.106) 




T=2.0 


0.139 


0.153 


0.147 


0.138 


0.122 






(0.141) 


(0.152) 


(0.149) 


(0.141) 


(0.141) 




T=4.0 


0.188 


0.219 


0.201 


0.187 


0.147 






(0.181) 


(0.209) 


(0.195) 


(0.181) 


(0.181) 




T=6.0 


0.226 


0.274 


0.238 


0.223 


0.161 






(0.219) 


(0.269) 


(0.238) 


(0.219) 


(0.219) 


LN 


0-2 = 0.405 


0.084 


0.085 


0.085 


0.086 


0.085 






(0.087) 


(0.086) 


(0.085) 


(0.087) 


(0.087) 




0-2 = 0.693 


0.098 


0.101 


0.101 


0.105 


0.097 






(0.106) 


(0.100) 


(0.102) 


(0.108) 


(0.108) 




0-2 = 1.098 


0.115 


0.121 


0.120 


0.134 


0.112 






(0.136) 


(0.120) 


(0.121) 


(0.136) 


(0.136) 




0-2 = 1.609 


0.135 


0.143 


0.139 


0.172 


0.128 






(0.179) 


(0.145) 


(0.149) 


(0.182) 


(0.182) 




0-2 = 1.946 


0.148 


0.157 


0.157 


0.199 


0.138 






(0.206) 


(0.163) 


(0.164) 


(0.213) 


(0.213) 


IG 


a=0.5 


0.084 


0.085 


0.085 


0.086 


0.085 






(0.090) 


(0.086) 


(0.086) 


(0.090) 


(0.090) 




a=1.0 


0.100 


0.103 


0.103 


0.107 


0.099 






(0.109) 


(0.107) 


(0.103) 


(0.109) 


(0.109) 




a=2.0 


0.121 


0.128 


0.125 


0.138 


0.116 






(0.142) 


(0.131) 


(0.131) 


(0.142) 


(0.142) 




a=4.0 


0.149 


0.159 


0.159 


0.186 


0.137 






(0.188) 


(0.164) 


(0.156) 


(0.188) 


(0.188) 




a=6.0 


0.168 


0.180 


0.179 


0.220 


0.149 






(0.229) 


(0.185) 


(0.182) 


(0.230) 


(0.230) 
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Table 5 

Empirical power by different distributions of v and different levels of overdispersion based 

on 500 replications 



true 






likelihood 




non-likelihood 




dist'n of u parm 


01 


GM 


LN 


IG 


RS 


QL 


GM T=0.5 


0.3 


0.94 


0.93 


0.93 


0.94 


0.94 




0.5 


1.00 


1.00 


1.00 


1.00 


1.00 


T=1.0 


0.3 


0.81 


0.77 


0.77 


0.82 


0.82 




0.5 


1.00 


1.00 


1.00 


1.00 


1.00 


T=2.0 


0.3 


0.59 


0.49 


0.49 


0.59 


0.68 




0.5 


0.95 


0.88 


0.87 


0.95 


0.97 


T=4.0 


0.3 


0.31 


0.22 


0.21 


0.33 


0.51 




0.5 


0.77 


0.58 


0.57 


0.76 


0.86 


T=6.0 


0.3 


0.24 


0.14 


0.14 


0.24 


0.43 




0.5 


0.61 


0.38 


0.38 


0.61 


0.82 


LN 0-2 ^ 0.405 


0.3 


0.95 


0.95 


0.94 


0.94 


0.94 




0.5 


1.00 


1.00 


1.00 


1.00 


1.00 


0-2 = 0.693 


0.3 


0.85 


0.86 


0.85 


0.78 


0.85 




0.5 


1.00 


1.00 


1.00 


0.99 


1.00 


0-2 = 1.098 


0.3 


0.73 


0.71 


0.70 


0.62 


0.75 




0.5 


0.99 


0.99 


0.99 


0.96 


0.99 


0-2 = 1.609 


0.3 


0.59 


0.55 


0.56 


0.45 


0.64 




0.5 


0.95 


0.95 


0.93 


0.83 


0.93 


0-2 = 1.946 


0.3 


0.52 


0.48 


0.48 


0.35 


0.56 




0.5 


0.85 


0.89 


0.88 


0.71 


0.86 


IG a=0.5 


0.3 


0.94 


0.94 


0.93 


0.94 


0.94 




0.5 


1.00 


1.00 


1.00 


1.00 


1.00 


a=1.0 


0.3 


0.82 


0.81 


0.80 


0.79 


0.83 




0.5 


1.00 


1.00 


1.00 


0.99 


1.00 


a=2.0 


0.3 


0.68 


0.65 


0.66 


0.60 


0.71 




0.5 


0.97 


0.97 


0.97 


0.95 


0.98 


a=4.0 


0.3 


0.54 


0.48 


0.50 


0.40 


0.60 




0.5 


0.89 


0.89 


0.90 


0.78 


0.89 


a=6.0 


0.3 


0.48 


0.42 


0.44 


0.32 


0.56 




0.5 


0.79 


0.79 


0.80 


0.64 


0.83 
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Table 6 

Rates of choosing the true models based on -2 log likelihood values and 500 replications 



true assumed dist'n of v 



distribution of z/ 


pajrameter 


/3i 


GM versus LN 


GM versus IG 


GM 


r=0.5 


0.3 


0.85 


0.83 






0.5 


0.87 


0.84 




r=1.0 


0.3 


0.96 


0.95 






0.5 


0.98 


0.96 




r=2.0 


0.3 


0.99 


0.99 






0.5 


1.00 


1.00 




r=4.0 


0.3 


1.00 


1.00 






0.5 


1.00 


0.99 




r=6.0 


0.3 


1.00 


0.99 






0.5 


1.00 


0.99 


distribution of u 


parameter 


/3i 


LN versus GM 


LN versus IG 


LN 


= 0.405 


0.3 


0.78 


0.44 






0.5 


0.77 


0.43 




= 0.693 


0.3 


0.88 


0.52 






0.5 


0.88 


0.53 




0-2 = 1.098 


0.3 


0.95 


0.59 






0.5 


0.97 


0.60 




= 1.609 


0.3 


0.95 


0.59 






0.5 


0.96 


0.60 




0-2 = 1.946 


0.3 


0.95 


0.59 






0.5 


0.94 


0.64 


distribution of u 


parameter 


/3i 


IG versus GM 


IG versus LN 


IG 


a=0.5 


0.3 


0.77 


0.68 






0.5 


0.79 


0.67 




a=1.0 


0.3 


0.91 


0.72 






0.5 


0.92 


0.75 




a=2.0 


0.3 


0.98 


0.81 






0.5 


0.97 


0.81 




a=4.0 


0.3 


0.99 


0.89 






0.5 


0.99 


0.87 




«=6.0 


0.3 


0.99 


0.95 






0.5 


0.99 


0.94 
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